Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 403776 |
| Missing cells | 71286 |
| Missing cells (%) | 1.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 55.5 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 15 |
|---|---|
| Categorical | 3 |
REF_NO is highly correlated with year | High correlation |
year is highly correlated with REF_NO | High correlation |
PM2.5 is highly correlated with PM10 and 2 other fields | High correlation |
PM10 is highly correlated with PM2.5 and 2 other fields | High correlation |
SO2 is highly correlated with NO2 and 1 other fields | High correlation |
NO2 is highly correlated with PM2.5 and 3 other fields | High correlation |
CO is highly correlated with PM2.5 and 3 other fields | High correlation |
O3 is highly correlated with TEMP | High correlation |
TEMP is highly correlated with O3 and 2 other fields | High correlation |
PRES is highly correlated with TEMP and 1 other fields | High correlation |
DEWP is highly correlated with TEMP and 1 other fields | High correlation |
REF_NO is highly correlated with year | High correlation |
year is highly correlated with REF_NO | High correlation |
PM2.5 is highly correlated with PM10 and 2 other fields | High correlation |
PM10 is highly correlated with PM2.5 and 2 other fields | High correlation |
SO2 is highly correlated with NO2 and 1 other fields | High correlation |
NO2 is highly correlated with PM2.5 and 4 other fields | High correlation |
CO is highly correlated with PM2.5 and 3 other fields | High correlation |
O3 is highly correlated with NO2 and 1 other fields | High correlation |
TEMP is highly correlated with O3 and 2 other fields | High correlation |
PRES is highly correlated with TEMP and 1 other fields | High correlation |
DEWP is highly correlated with TEMP and 1 other fields | High correlation |
REF_NO is highly correlated with year | High correlation |
year is highly correlated with REF_NO | High correlation |
PM2.5 is highly correlated with PM10 and 1 other fields | High correlation |
PM10 is highly correlated with PM2.5 and 1 other fields | High correlation |
NO2 is highly correlated with CO | High correlation |
CO is highly correlated with PM2.5 and 2 other fields | High correlation |
TEMP is highly correlated with PRES and 1 other fields | High correlation |
PRES is highly correlated with TEMP and 1 other fields | High correlation |
DEWP is highly correlated with TEMP and 1 other fields | High correlation |
month is highly correlated with TEMP and 3 other fields | High correlation |
TEMP is highly correlated with month and 3 other fields | High correlation |
CO is highly correlated with PM10 and 2 other fields | High correlation |
PM10 is highly correlated with CO and 2 other fields | High correlation |
DEWP is highly correlated with month and 3 other fields | High correlation |
PM2.5 is highly correlated with CO and 2 other fields | High correlation |
NO2 is highly correlated with CO and 2 other fields | High correlation |
REF_NO is highly correlated with month and 4 other fields | High correlation |
PRES is highly correlated with month and 3 other fields | High correlation |
year is highly correlated with REF_NO | High correlation |
PM2.5 has 8475 (2.1%) missing values | Missing |
PM10 has 6222 (1.5%) missing values | Missing |
SO2 has 8776 (2.2%) missing values | Missing |
NO2 has 11859 (2.9%) missing values | Missing |
CO has 20261 (5.0%) missing values | Missing |
O3 has 13007 (3.2%) missing values | Missing |
RAIN is highly skewed (γ1 = 29.4402448) | Skewed |
REF_NO is uniformly distributed | Uniform |
station is uniformly distributed | Uniform |
hour has 16824 (4.2%) zeros | Zeros |
RAIN has 387119 (95.9%) zeros | Zeros |
WSPM has 10891 (2.7%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-27 15:09:18.889390 |
|---|---|
| Analysis finished | 2021-05-27 15:12:59.157415 |
| Duration | 3 minutes and 40.27 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 33648 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16824.5 |
| Minimum | 1 |
|---|---|
| Maximum | 33648 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1683 |
| Q1 | 8412.75 |
| median | 16824.5 |
| Q3 | 25236.25 |
| 95-th percentile | 31966 |
| Maximum | 33648 |
| Range | 33647 |
| Interquartile range (IQR) | 16823.5 |
Descriptive statistics
| Standard deviation | 9713.352953 |
|---|---|
| Coefficient of variation (CV) | 0.5773338258 |
| Kurtosis | -1.200000002 |
| Mean | 16824.5 |
| Median Absolute Deviation (MAD) | 8412 |
| Skewness | 0 |
| Sum | 6793329312 |
| Variance | 94349225.58 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2049 | 12 | < 0.1% |
| 938 | 12 | < 0.1% |
| 5032 | 12 | < 0.1% |
| 27559 | 12 | < 0.1% |
| 25510 | 12 | < 0.1% |
| 31653 | 12 | < 0.1% |
| 29604 | 12 | < 0.1% |
| 19363 | 12 | < 0.1% |
| 17314 | 12 | < 0.1% |
| 23457 | 12 | < 0.1% |
| Other values (33638) | 403656 |
| Value | Count | Frequency (%) |
| 1 | 12 | |
| 2 | 12 | |
| 3 | 12 | |
| 4 | 12 | |
| 5 | 12 | |
| 6 | 12 | |
| 7 | 12 | |
| 8 | 12 | |
| 9 | 12 | |
| 10 | 12 |
| Value | Count | Frequency (%) |
| 33648 | 12 | |
| 33647 | 12 | |
| 33646 | 12 | |
| 33645 | 12 | |
| 33644 | 12 | |
| 33643 | 12 | |
| 33642 | 12 | |
| 33641 | 12 | |
| 33640 | 12 | |
| 33639 | 12 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| 2016 | |
|---|---|
| 2015 | |
| 2014 | |
| 2013 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1615104 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2013 |
|---|---|
| 2nd row | 2013 |
| 3rd row | 2013 |
| 4th row | 2013 |
| 5th row | 2013 |
Common Values
| Value | Count | Frequency (%) |
| 2016 | 105408 | |
| 2015 | 105120 | |
| 2014 | 105120 | |
| 2013 | 88128 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2016 | 105408 | |
| 2015 | 105120 | |
| 2014 | 105120 | |
| 2013 | 88128 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 403776 | |
| 0 | 403776 | |
| 1 | 403776 | |
| 6 | 105408 | 6.5% |
| 4 | 105120 | 6.5% |
| 5 | 105120 | 6.5% |
| 3 | 88128 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1615104 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 403776 | |
| 0 | 403776 | |
| 1 | 403776 | |
| 6 | 105408 | 6.5% |
| 4 | 105120 | 6.5% |
| 5 | 105120 | 6.5% |
| 3 | 88128 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1615104 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 403776 | |
| 0 | 403776 | |
| 1 | 403776 | |
| 6 | 105408 | 6.5% |
| 4 | 105120 | 6.5% |
| 5 | 105120 | 6.5% |
| 3 | 88128 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1615104 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 403776 | |
| 0 | 403776 | |
| 1 | 403776 | |
| 6 | 105408 | 6.5% |
| 4 | 105120 | 6.5% |
| 5 | 105120 | 6.5% |
| 3 | 88128 | 5.5% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.735378031 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.356479072 |
|---|---|
| Coefficient of variation (CV) | 0.4983356623 |
| Kurtosis | -1.157296025 |
| Mean | 6.735378031 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.0532691034 |
| Sum | 2719584 |
| Variance | 11.26595176 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 35712 | |
| 5 | 35712 | |
| 7 | 35712 | |
| 8 | 35712 | |
| 10 | 35712 | |
| 12 | 35712 | |
| 4 | 34560 | |
| 6 | 34560 | |
| 9 | 34560 | |
| 11 | 34560 | |
| Other values (2) | 51264 |
| Value | Count | Frequency (%) |
| 1 | 26784 | |
| 2 | 24480 | |
| 3 | 35712 | |
| 4 | 34560 | |
| 5 | 35712 | |
| 6 | 34560 | |
| 7 | 35712 | |
| 8 | 35712 | |
| 9 | 34560 | |
| 10 | 35712 |
| Value | Count | Frequency (%) |
| 12 | 35712 | |
| 11 | 34560 | |
| 10 | 35712 | |
| 9 | 34560 | |
| 8 | 35712 | |
| 7 | 35712 | |
| 6 | 34560 | |
| 5 | 35712 | |
| 4 | 34560 | |
| 3 | 35712 |
day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.74821683 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.808891484 |
|---|---|
| Coefficient of variation (CV) | 0.5593580262 |
| Kurtosis | -1.195325155 |
| Mean | 15.74821683 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.005682826695 |
| Sum | 6358752 |
| Variance | 77.59656917 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 13248 | 3.3% |
| 2 | 13248 | 3.3% |
| 28 | 13248 | 3.3% |
| 27 | 13248 | 3.3% |
| 26 | 13248 | 3.3% |
| 25 | 13248 | 3.3% |
| 24 | 13248 | 3.3% |
| 23 | 13248 | 3.3% |
| 22 | 13248 | 3.3% |
| 21 | 13248 | 3.3% |
| Other values (21) | 271296 |
| Value | Count | Frequency (%) |
| 1 | 13248 | |
| 2 | 13248 | |
| 3 | 13248 | |
| 4 | 13248 | |
| 5 | 13248 | |
| 6 | 13248 | |
| 7 | 13248 | |
| 8 | 13248 | |
| 9 | 13248 | |
| 10 | 13248 |
| Value | Count | Frequency (%) |
| 31 | 7776 | |
| 30 | 12384 | |
| 29 | 12672 | |
| 28 | 13248 | |
| 27 | 13248 | |
| 26 | 13248 | |
| 25 | 13248 | |
| 24 | 13248 | |
| 23 | 13248 | |
| 22 | 13248 |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.5 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 16824 |
| Zeros (%) | 4.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5.75 |
| median | 11.5 |
| Q3 | 17.25 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 11.5 |
Descriptive statistics
| Standard deviation | 6.922195124 |
|---|---|
| Coefficient of variation (CV) | 0.6019300108 |
| Kurtosis | -1.204173965 |
| Mean | 11.5 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0 |
| Sum | 4643424 |
| Variance | 47.91678534 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16824 | 4.2% |
| 1 | 16824 | 4.2% |
| 22 | 16824 | 4.2% |
| 21 | 16824 | 4.2% |
| 20 | 16824 | 4.2% |
| 19 | 16824 | 4.2% |
| 18 | 16824 | 4.2% |
| 17 | 16824 | 4.2% |
| 16 | 16824 | 4.2% |
| 15 | 16824 | 4.2% |
| Other values (14) | 235536 |
| Value | Count | Frequency (%) |
| 0 | 16824 | |
| 1 | 16824 | |
| 2 | 16824 | |
| 3 | 16824 | |
| 4 | 16824 | |
| 5 | 16824 | |
| 6 | 16824 | |
| 7 | 16824 | |
| 8 | 16824 | |
| 9 | 16824 |
| Value | Count | Frequency (%) |
| 23 | 16824 | |
| 22 | 16824 | |
| 21 | 16824 | |
| 20 | 16824 | |
| 19 | 16824 | |
| 18 | 16824 | |
| 17 | 16824 | |
| 16 | 16824 | |
| 15 | 16824 | |
| 14 | 16824 |
| Distinct | 866 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 8475 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.24827511 |
| Minimum | 2 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 21 |
| median | 55 |
| Q3 | 110 |
| 95-th percentile | 238 |
| Maximum | 999 |
| Range | 997 |
| Interquartile range (IQR) | 89 |
Descriptive statistics
| Standard deviation | 79.14670837 |
|---|---|
| Coefficient of variation (CV) | 0.9987183728 |
| Kurtosis | 5.728756991 |
| Mean | 79.24827511 |
| Median Absolute Deviation (MAD) | 39 |
| Skewness | 1.974286544 |
| Sum | 31326922.4 |
| Variance | 6264.201445 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 8354 | 2.1% |
| 10 | 6609 | 1.6% |
| 11 | 6418 | 1.6% |
| 9 | 6374 | 1.6% |
| 12 | 6346 | 1.6% |
| 8 | 6333 | 1.6% |
| 13 | 5830 | 1.4% |
| 14 | 5765 | 1.4% |
| 7 | 5742 | 1.4% |
| 6 | 5116 | 1.3% |
| Other values (856) | 332414 | |
| (Missing) | 8475 | 2.1% |
| Value | Count | Frequency (%) |
| 2 | 7 | < 0.1% |
| 3 | 8354 | |
| 4 | 3221 | 0.8% |
| 4.3 | 2 | < 0.1% |
| 4.4 | 1 | < 0.1% |
| 4.6 | 1 | < 0.1% |
| 5 | 3984 | |
| 6 | 5116 | |
| 7 | 5742 | |
| 7.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 999 | 1 | |
| 957 | 1 | |
| 941 | 1 | |
| 898 | 1 | |
| 882 | 1 | |
| 881 | 1 | |
| 857 | 1 | |
| 844 | 1 | |
| 826 | 1 | |
| 821 | 1 |
| Distinct | 1048 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 6222 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 104.3278973 |
| Minimum | 2 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 36 |
| median | 83 |
| Q3 | 145 |
| 95-th percentile | 277 |
| Maximum | 999 |
| Range | 997 |
| Interquartile range (IQR) | 109 |
Descriptive statistics
| Standard deviation | 90.13639956 |
|---|---|
| Coefficient of variation (CV) | 0.8639721671 |
| Kurtosis | 5.737014891 |
| Mean | 104.3278973 |
| Median Absolute Deviation (MAD) | 52 |
| Skewness | 1.816481632 |
| Sum | 41475972.9 |
| Variance | 8124.570525 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 4712 | 1.2% |
| 5 | 3547 | 0.9% |
| 18 | 3523 | 0.9% |
| 14 | 3493 | 0.9% |
| 16 | 3405 | 0.8% |
| 17 | 3383 | 0.8% |
| 13 | 3349 | 0.8% |
| 20 | 3336 | 0.8% |
| 24 | 3240 | 0.8% |
| 21 | 3229 | 0.8% |
| Other values (1038) | 362337 | |
| (Missing) | 6222 | 1.5% |
| Value | Count | Frequency (%) |
| 2 | 103 | < 0.1% |
| 3 | 719 | 0.2% |
| 4 | 264 | 0.1% |
| 5 | 3547 | |
| 5.4 | 2 | < 0.1% |
| 5.6 | 1 | < 0.1% |
| 6 | 4712 | |
| 6.4 | 1 | < 0.1% |
| 6.6 | 1 | < 0.1% |
| 7 | 2245 |
| Value | Count | Frequency (%) |
| 999 | 3 | |
| 995 | 1 | < 0.1% |
| 993 | 1 | < 0.1% |
| 992 | 1 | < 0.1% |
| 991 | 1 | < 0.1% |
| 988 | 1 | < 0.1% |
| 987 | 1 | < 0.1% |
| 986 | 1 | < 0.1% |
| 984 | 1 | < 0.1% |
| 983 | 1 | < 0.1% |
| Distinct | 685 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 8776 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.73305999 |
| Minimum | 0.2856 |
|---|---|
| Maximum | 500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0.2856 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 7 |
| Q3 | 19 |
| 95-th percentile | 61 |
| Maximum | 500 |
| Range | 499.7144 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 21.73945549 |
|---|---|
| Coefficient of variation (CV) | 1.381769058 |
| Kurtosis | 14.00498947 |
| Mean | 15.73305999 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 3.007737087 |
| Sum | 6214558.695 |
| Variance | 472.6039248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 97027 | |
| 3 | 31771 | 7.9% |
| 4 | 20810 | 5.2% |
| 5 | 17091 | 4.2% |
| 6 | 15762 | 3.9% |
| 7 | 13639 | 3.4% |
| 8 | 12722 | 3.2% |
| 9 | 10952 | 2.7% |
| 10 | 10096 | 2.5% |
| 11 | 8863 | 2.2% |
| Other values (675) | 156267 | |
| (Missing) | 8776 | 2.2% |
| Value | Count | Frequency (%) |
| 0.2856 | 89 | < 0.1% |
| 0.5712 | 70 | < 0.1% |
| 0.8568 | 72 | < 0.1% |
| 1 | 3221 | 0.8% |
| 1.1424 | 84 | < 0.1% |
| 1.428 | 94 | < 0.1% |
| 1.7136 | 83 | < 0.1% |
| 1.9992 | 110 | < 0.1% |
| 2 | 97027 | |
| 2.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 500 | 3 | |
| 411 | 1 | < 0.1% |
| 341 | 1 | < 0.1% |
| 315 | 1 | < 0.1% |
| 314 | 1 | < 0.1% |
| 310 | 1 | < 0.1% |
| 299 | 1 | < 0.1% |
| 282 | 1 | < 0.1% |
| 278 | 1 | < 0.1% |
| 277 | 1 | < 0.1% |
| Distinct | 1209 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 11859 |
| Missing (%) | 2.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.35278459 |
| Minimum | 1.0265 |
|---|---|
| Maximum | 290 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1.0265 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 23 |
| median | 43 |
| Q3 | 71 |
| 95-th percentile | 116 |
| Maximum | 290 |
| Range | 288.9735 |
| Interquartile range (IQR) | 48 |
Descriptive statistics
| Standard deviation | 34.77190967 |
|---|---|
| Coefficient of variation (CV) | 0.6905657741 |
| Kurtosis | 1.211420478 |
| Mean | 50.35278459 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 1.052701359 |
| Sum | 19734112.28 |
| Variance | 1209.085702 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 5572 | 1.4% |
| 22 | 5556 | 1.4% |
| 20 | 5523 | 1.4% |
| 17 | 5467 | 1.4% |
| 18 | 5441 | 1.3% |
| 26 | 5420 | 1.3% |
| 21 | 5416 | 1.3% |
| 19 | 5368 | 1.3% |
| 14 | 5366 | 1.3% |
| 24 | 5358 | 1.3% |
| Other values (1199) | 337430 | |
| (Missing) | 11859 | 2.9% |
| Value | Count | Frequency (%) |
| 1.0265 | 3 | < 0.1% |
| 1.2318 | 2 | < 0.1% |
| 1.4371 | 2 | < 0.1% |
| 1.6424 | 3 | < 0.1% |
| 1.8477 | 1 | < 0.1% |
| 2 | 4364 | |
| 2.053 | 1 | < 0.1% |
| 2.2583 | 3 | < 0.1% |
| 2.4636 | 1 | < 0.1% |
| 2.6689 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 290 | 1 | |
| 285 | 1 | |
| 280 | 1 | |
| 277 | 2 | |
| 273 | 1 | |
| 270 | 1 | |
| 269 | 1 | |
| 265 | 1 | |
| 264 | 1 | |
| 263 | 2 |
| Distinct | 132 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 20261 |
| Missing (%) | 5.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1214.843339 |
| Minimum | 100 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 200 |
| Q1 | 500 |
| median | 900 |
| Q3 | 1500 |
| 95-th percentile | 3400 |
| Maximum | 10000 |
| Range | 9900 |
| Interquartile range (IQR) | 1000 |
Descriptive statistics
| Standard deviation | 1124.285676 |
|---|---|
| Coefficient of variation (CV) | 0.925457333 |
| Kurtosis | 9.450258696 |
| Mean | 1214.843339 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 2.56066181 |
| Sum | 465910643 |
| Variance | 1264018.282 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 300 | 30662 | 7.6% |
| 400 | 29849 | 7.4% |
| 500 | 28043 | 6.9% |
| 600 | 27189 | 6.7% |
| 700 | 25720 | 6.4% |
| 800 | 22728 | 5.6% |
| 900 | 20655 | 5.1% |
| 1000 | 19026 | 4.7% |
| 200 | 17370 | 4.3% |
| 1100 | 17009 | 4.2% |
| Other values (122) | 145264 | |
| (Missing) | 20261 | 5.0% |
| Value | Count | Frequency (%) |
| 100 | 5091 | 1.3% |
| 150 | 1 | < 0.1% |
| 200 | 17370 | |
| 300 | 30662 | |
| 350 | 1 | < 0.1% |
| 400 | 29849 | |
| 500 | 28043 | |
| 600 | 27189 | |
| 700 | 25720 | |
| 800 | 22728 |
| Value | Count | Frequency (%) |
| 10000 | 51 | |
| 9900 | 25 | |
| 9800 | 24 | |
| 9700 | 23 | |
| 9600 | 23 | |
| 9500 | 22 | |
| 9400 | 25 | |
| 9300 | 31 | |
| 9200 | 31 | |
| 9100 | 31 |
| Distinct | 1597 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 13007 |
| Missing (%) | 3.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58.11932675 |
| Minimum | 0.2142 |
|---|---|
| Maximum | 1071 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0.2142 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 11 |
| median | 45 |
| Q3 | 83 |
| 95-th percentile | 180 |
| Maximum | 1071 |
| Range | 1070.7858 |
| Interquartile range (IQR) | 72 |
Descriptive statistics
| Standard deviation | 57.37596606 |
|---|---|
| Coefficient of variation (CV) | 0.9872097505 |
| Kurtosis | 6.074069635 |
| Mean | 58.11932675 |
| Median Absolute Deviation (MAD) | 36 |
| Skewness | 1.635163683 |
| Sum | 22711231.2 |
| Variance | 3292.001482 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 40544 | 10.0% |
| 3 | 8245 | 2.0% |
| 4 | 7636 | 1.9% |
| 1 | 6878 | 1.7% |
| 5 | 6129 | 1.5% |
| 6 | 5641 | 1.4% |
| 8 | 4796 | 1.2% |
| 7 | 4642 | 1.1% |
| 10 | 3940 | 1.0% |
| 9 | 3936 | 1.0% |
| Other values (1587) | 298382 | |
| (Missing) | 13007 | 3.2% |
| Value | Count | Frequency (%) |
| 0.2142 | 134 | < 0.1% |
| 0.4284 | 119 | < 0.1% |
| 0.6426 | 118 | < 0.1% |
| 0.8568 | 120 | < 0.1% |
| 1 | 6878 | |
| 1.071 | 138 | < 0.1% |
| 1.2852 | 147 | < 0.1% |
| 1.4994 | 166 | < 0.1% |
| 1.7136 | 125 | < 0.1% |
| 1.9278 | 147 | < 0.1% |
| Value | Count | Frequency (%) |
| 1071 | 14 | |
| 1050 | 1 | < 0.1% |
| 1026 | 1 | < 0.1% |
| 674 | 1 | < 0.1% |
| 673 | 1 | < 0.1% |
| 500 | 5 | < 0.1% |
| 450 | 1 | < 0.1% |
| 444 | 1 | < 0.1% |
| 432 | 1 | < 0.1% |
| 429 | 1 | < 0.1% |
| Distinct | 1180 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 264 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.08889947 |
| Minimum | -19.9 |
|---|---|
| Maximum | 41.6 |
| Zeros | 2642 |
| Zeros (%) | 0.7% |
| Negative | 55474 |
| Negative (%) | 13.7% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -19.9 |
|---|---|
| 5-th percentile | -4 |
| Q1 | 4 |
| median | 15.4 |
| Q3 | 23.5 |
| 95-th percentile | 30.7 |
| Maximum | 41.6 |
| Range | 61.5 |
| Interquartile range (IQR) | 19.5 |
Descriptive statistics
| Standard deviation | 11.30353352 |
|---|---|
| Coefficient of variation (CV) | 0.802300672 |
| Kurtosis | -1.087420248 |
| Mean | 14.08889947 |
| Median Absolute Deviation (MAD) | 9.4 |
| Skewness | -0.1686978359 |
| Sum | 5685040.005 |
| Variance | 127.76987 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 3342 | 0.8% |
| 1 | 2796 | 0.7% |
| 0 | 2642 | 0.7% |
| 2 | 2556 | 0.6% |
| -1 | 2436 | 0.6% |
| -2 | 2293 | 0.6% |
| -4 | 1844 | 0.5% |
| 4 | 1772 | 0.4% |
| 5 | 1680 | 0.4% |
| -5 | 1633 | 0.4% |
| Other values (1170) | 380518 |
| Value | Count | Frequency (%) |
| -19.9 | 1 | |
| -19.7 | 1 | |
| -19.5 | 1 | |
| -18.9 | 1 | |
| -18.7 | 1 | |
| -18.5 | 1 | |
| -18.1 | 1 | |
| -17.9 | 1 | |
| -17.4 | 1 | |
| -17.3 | 1 |
| Value | Count | Frequency (%) |
| 41.6 | 1 | < 0.1% |
| 41.4 | 2 | < 0.1% |
| 41.1 | 3 | < 0.1% |
| 41 | 2 | < 0.1% |
| 40.9 | 1 | < 0.1% |
| 40.6 | 2 | < 0.1% |
| 40.5 | 8 | |
| 40.4 | 3 | < 0.1% |
| 40.3 | 4 | |
| 40.2 | 2 | < 0.1% |
| Distinct | 675 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 265 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1010.282534 |
| Minimum | 982.4 |
|---|---|
| Maximum | 1042.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 982.4 |
|---|---|
| 5-th percentile | 994.6 |
| Q1 | 1002 |
| median | 1009.8 |
| Q3 | 1018.3 |
| 95-th percentile | 1027.4 |
| Maximum | 1042.8 |
| Range | 60.4 |
| Interquartile range (IQR) | 16.3 |
Descriptive statistics
| Standard deviation | 10.35677799 |
|---|---|
| Coefficient of variation (CV) | 0.01025136795 |
| Kurtosis | -0.7829195448 |
| Mean | 1010.282534 |
| Median Absolute Deviation (MAD) | 8.2 |
| Skewness | 0.1519478448 |
| Sum | 407660115.7 |
| Variance | 107.2628503 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1019 | 2712 | 0.7% |
| 1018 | 2695 | 0.7% |
| 1021 | 2691 | 0.7% |
| 1015 | 2602 | 0.6% |
| 1023 | 2596 | 0.6% |
| 1020 | 2570 | 0.6% |
| 1017 | 2554 | 0.6% |
| 1016 | 2528 | 0.6% |
| 1022 | 2474 | 0.6% |
| 1024 | 2455 | 0.6% |
| Other values (665) | 377634 |
| Value | Count | Frequency (%) |
| 982.4 | 2 | < 0.1% |
| 982.7 | 2 | < 0.1% |
| 982.8 | 3 | |
| 982.9 | 2 | < 0.1% |
| 983 | 4 | |
| 983.2 | 4 | |
| 983.3 | 3 | |
| 983.4 | 2 | < 0.1% |
| 983.5 | 6 | |
| 983.6 | 4 |
| Value | Count | Frequency (%) |
| 1042.8 | 2 | < 0.1% |
| 1042.4 | 1 | < 0.1% |
| 1042.3 | 2 | < 0.1% |
| 1042.2 | 1 | < 0.1% |
| 1042 | 11 | |
| 1041.8 | 8 | |
| 1041.7 | 1 | < 0.1% |
| 1041.6 | 7 | |
| 1041.5 | 2 | < 0.1% |
| 1041.4 | 8 |
| Distinct | 645 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 269 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.157291447 |
| Minimum | -43.4 |
|---|---|
| Maximum | 29.1 |
| Zeros | 828 |
| Zeros (%) | 0.2% |
| Negative | 168595 |
| Negative (%) | 41.8% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -43.4 |
|---|---|
| 5-th percentile | -19.4 |
| Q1 | -8 |
| median | 4.2 |
| Q3 | 15.5 |
| 95-th percentile | 22.2 |
| Maximum | 29.1 |
| Range | 72.5 |
| Interquartile range (IQR) | 23.5 |
Descriptive statistics
| Standard deviation | 13.61727272 |
|---|---|
| Coefficient of variation (CV) | 4.312960315 |
| Kurtosis | -1.078189528 |
| Mean | 3.157291447 |
| Median Absolute Deviation (MAD) | 11.6 |
| Skewness | -0.2500222557 |
| Sum | 1273989.2 |
| Variance | 185.4301162 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17.6 | 1559 | 0.4% |
| 17 | 1519 | 0.4% |
| 17.2 | 1490 | 0.4% |
| 16.8 | 1483 | 0.4% |
| 17.3 | 1455 | 0.4% |
| 17.1 | 1445 | 0.4% |
| 17.8 | 1440 | 0.4% |
| 16.2 | 1429 | 0.4% |
| 18.2 | 1426 | 0.4% |
| 17.5 | 1409 | 0.3% |
| Other values (635) | 388852 |
| Value | Count | Frequency (%) |
| -43.4 | 1 | < 0.1% |
| -36 | 1 | < 0.1% |
| -35.7 | 1 | < 0.1% |
| -35.5 | 1 | < 0.1% |
| -35.3 | 7 | |
| -35.1 | 9 | |
| -35 | 6 | |
| -34.9 | 2 | < 0.1% |
| -34.8 | 7 | |
| -34.6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 29.1 | 2 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28.8 | 10 | |
| 28.7 | 12 | |
| 28.6 | 2 | < 0.1% |
| 28.5 | 12 | |
| 28.4 | 14 | |
| 28.3 | 14 | |
| 28.2 | 9 | |
| 28.1 | 9 |
| Distinct | 253 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 261 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06705178246 |
| Minimum | 0 |
|---|---|
| Maximum | 72.5 |
| Zeros | 387119 |
| Zeros (%) | 95.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 72.5 |
| Range | 72.5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8378448668 |
|---|---|
| Coefficient of variation (CV) | 12.49548984 |
| Kurtosis | 1291.908304 |
| Mean | 0.06705178246 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 29.4402448 |
| Sum | 27056.4 |
| Variance | 0.7019840208 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 387119 | |
| 0.1 | 3689 | 0.9% |
| 0.2 | 1823 | 0.5% |
| 0.3 | 1374 | 0.3% |
| 0.4 | 885 | 0.2% |
| 0.5 | 847 | 0.2% |
| 0.6 | 698 | 0.2% |
| 0.7 | 585 | 0.1% |
| 0.9 | 502 | 0.1% |
| 0.8 | 482 | 0.1% |
| Other values (243) | 5511 | 1.4% |
| Value | Count | Frequency (%) |
| 0 | 387119 | |
| 0.1 | 3689 | 0.9% |
| 0.2 | 1823 | 0.5% |
| 0.3 | 1374 | 0.3% |
| 0.4 | 885 | 0.2% |
| 0.5 | 847 | 0.2% |
| 0.6 | 698 | 0.2% |
| 0.7 | 585 | 0.1% |
| 0.8 | 482 | 0.1% |
| 0.9 | 502 | 0.1% |
| Value | Count | Frequency (%) |
| 72.5 | 3 | |
| 52.1 | 2 | < 0.1% |
| 47.7 | 1 | < 0.1% |
| 46.4 | 6 | |
| 45.9 | 2 | < 0.1% |
| 41.9 | 1 | < 0.1% |
| 40.7 | 3 | |
| 39 | 1 | < 0.1% |
| 38.9 | 1 | < 0.1% |
| 37.4 | 2 | < 0.1% |
wd
Categorical
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1389 |
| Missing (%) | 0.3% |
| Memory size | 3.1 MiB |
| NE | |
|---|---|
| ENE | |
| N | |
| NW | |
| E | |
| Other values (11) |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.238176184 |
| Min length | 1 |
Characters and Unicode
| Total characters | 900613 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NNW |
|---|---|
| 2nd row | N |
| 3rd row | NNW |
| 4th row | NW |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| NE | 40049 | 9.9% |
| ENE | 33262 | 8.2% |
| N | 29973 | 7.4% |
| NW | 29587 | 7.3% |
| E | 29168 | 7.2% |
| NNE | 27247 | 6.7% |
| SW | 27083 | 6.7% |
| NNW | 24167 | 6.0% |
| WNW | 23815 | 5.9% |
| ESE | 23691 | 5.9% |
| Other values (6) | 114345 |
Length
| Value | Count | Frequency (%) |
| ne | 40049 | 10.0% |
| ene | 33262 | 8.3% |
| n | 29973 | 7.4% |
| nw | 29587 | 7.4% |
| e | 29168 | 7.2% |
| nne | 27247 | 6.8% |
| sw | 27083 | 6.7% |
| nnw | 24167 | 6.0% |
| wnw | 23815 | 5.9% |
| ese | 23691 | 5.9% |
| Other values (6) | 114345 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 259514 | |
| E | 246725 | |
| W | 207045 | |
| S | 187329 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 900613 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 259514 | |
| E | 246725 | |
| W | 207045 | |
| S | 187329 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 900613 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 259514 | |
| E | 246725 | |
| W | 207045 | |
| S | 187329 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 900613 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 259514 | |
| E | 246725 | |
| W | 207045 | |
| S | 187329 |
| Distinct | 115 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 238 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.718379682 |
| Minimum | 0 |
|---|---|
| Maximum | 13.2 |
| Zeros | 10891 |
| Zeros (%) | 2.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.3 |
| Q1 | 0.9 |
| median | 1.4 |
| Q3 | 2.2 |
| 95-th percentile | 4.2 |
| Maximum | 13.2 |
| Range | 13.2 |
| Interquartile range (IQR) | 1.3 |
Descriptive statistics
| Standard deviation | 1.237964878 |
|---|---|
| Coefficient of variation (CV) | 0.7204256958 |
| Kurtosis | 3.691546729 |
| Mean | 1.718379682 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | 1.625270041 |
| Sum | 693431.5 |
| Variance | 1.532557039 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.1 | 21486 | 5.3% |
| 1 | 21370 | 5.3% |
| 1.2 | 21228 | 5.3% |
| 0.9 | 20237 | 5.0% |
| 1.3 | 19640 | 4.9% |
| 0.8 | 18585 | 4.6% |
| 1.4 | 17776 | 4.4% |
| 0.7 | 16969 | 4.2% |
| 1.5 | 16273 | 4.0% |
| 1.6 | 15098 | 3.7% |
| Other values (105) | 214876 |
| Value | Count | Frequency (%) |
| 0 | 10891 | |
| 0.1 | 4175 | 1.0% |
| 0.2 | 4378 | 1.1% |
| 0.3 | 2673 | 0.7% |
| 0.4 | 7154 | 1.8% |
| 0.5 | 10842 | |
| 0.6 | 13881 | |
| 0.7 | 16969 | |
| 0.8 | 18585 | |
| 0.9 | 20237 |
| Value | Count | Frequency (%) |
| 13.2 | 1 | < 0.1% |
| 12.9 | 1 | < 0.1% |
| 12.8 | 1 | < 0.1% |
| 11.8 | 1 | < 0.1% |
| 11.7 | 1 | < 0.1% |
| 11.2 | 3 | |
| 11 | 1 | < 0.1% |
| 10.9 | 3 | |
| 10.7 | 1 | < 0.1% |
| 10.5 | 3 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| Wanshouxigong | |
|---|---|
| Gucheng | |
| Nongzhanguan | |
| Tiantan | |
| Shunyi | |
| Other values (7) |
Length
| Max length | 13 |
|---|---|
| Median length | 7.5 |
| Mean length | 8.416666667 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3398448 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aotizhongxin |
|---|---|
| 2nd row | Aotizhongxin |
| 3rd row | Aotizhongxin |
| 4th row | Aotizhongxin |
| 5th row | Aotizhongxin |
Common Values
| Value | Count | Frequency (%) |
| Wanshouxigong | 33648 | |
| Gucheng | 33648 | |
| Nongzhanguan | 33648 | |
| Tiantan | 33648 | |
| Shunyi | 33648 | |
| Dingling | 33648 | |
| Guanyuan | 33648 | |
| Dongsi | 33648 | |
| Wanliu | 33648 | |
| Huairou | 33648 | |
| Other values (2) | 67296 |
Length
| Value | Count | Frequency (%) |
| guanyuan | 33648 | |
| dongsi | 33648 | |
| dingling | 33648 | |
| tiantan | 33648 | |
| gucheng | 33648 | |
| nongzhanguan | 33648 | |
| wanliu | 33648 | |
| wanshouxigong | 33648 | |
| aotizhongxin | 33648 | |
| changping | 33648 | |
| Other values (2) | 67296 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 639312 | |
| i | 370128 | |
| g | 370128 | |
| a | 336480 | |
| u | 302832 | |
| o | 235536 | 6.9% |
| h | 201888 | 5.9% |
| t | 67296 | 2.0% |
| z | 67296 | 2.0% |
| x | 67296 | 2.0% |
| Other values (16) | 740256 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2994672 | |
| Uppercase Letter | 403776 | 11.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 639312 | |
| i | 370128 | |
| g | 370128 | |
| a | 336480 | |
| u | 302832 | |
| o | 235536 | 7.9% |
| h | 201888 | 6.7% |
| t | 67296 | 2.2% |
| z | 67296 | 2.2% |
| x | 67296 | 2.2% |
| Other values (7) | 336480 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 67296 | |
| G | 67296 | |
| W | 67296 | |
| A | 33648 | |
| C | 33648 | |
| H | 33648 | |
| N | 33648 | |
| S | 33648 | |
| T | 33648 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3398448 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 639312 | |
| i | 370128 | |
| g | 370128 | |
| a | 336480 | |
| u | 302832 | |
| o | 235536 | 6.9% |
| h | 201888 | 5.9% |
| t | 67296 | 2.0% |
| z | 67296 | 2.0% |
| x | 67296 | 2.0% |
| Other values (16) | 740256 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3398448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 639312 | |
| i | 370128 | |
| g | 370128 | |
| a | 336480 | |
| u | 302832 | |
| o | 235536 | 6.9% |
| h | 201888 | 5.9% |
| t | 67296 | 2.0% |
| z | 67296 | 2.0% |
| x | 67296 | 2.0% |
| Other values (16) | 740256 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| REF_NO | year | month | day | hour | PM2.5 | PM10 | SO2 | NO2 | CO | O3 | TEMP | PRES | DEWP | RAIN | wd | WSPM | station | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2013 | 3 | 1 | 0 | 4.00000 | 4.00000 | 4.00000 | 7.00000 | 300.00000 | 77.00000 | -0.70000 | 1023.00000 | -18.80000 | 0.00000 | NNW | 4.40000 | Aotizhongxin |
| 1 | 2 | 2013 | 3 | 1 | 1 | 8.00000 | 8.00000 | 4.00000 | 7.00000 | 300.00000 | 77.00000 | -1.10000 | 1023.20000 | -18.20000 | 0.00000 | N | 4.70000 | Aotizhongxin |
| 2 | 3 | 2013 | 3 | 1 | 2 | 7.00000 | 7.00000 | 5.00000 | 10.00000 | 300.00000 | 73.00000 | -1.10000 | 1023.50000 | -18.20000 | 0.00000 | NNW | 5.60000 | Aotizhongxin |
| 3 | 4 | 2013 | 3 | 1 | 3 | 6.00000 | 6.00000 | 11.00000 | 11.00000 | 300.00000 | 72.00000 | -1.40000 | 1024.50000 | -19.40000 | 0.00000 | NW | 3.10000 | Aotizhongxin |
| 4 | 5 | 2013 | 3 | 1 | 4 | 3.00000 | 3.00000 | 12.00000 | 12.00000 | 300.00000 | 72.00000 | -2.00000 | 1025.20000 | -19.50000 | 0.00000 | N | 2.00000 | Aotizhongxin |
| 5 | 6 | 2013 | 3 | 1 | 5 | 5.00000 | 5.00000 | 18.00000 | 18.00000 | 400.00000 | 66.00000 | -2.20000 | 1025.60000 | -19.60000 | 0.00000 | N | 3.70000 | Aotizhongxin |
| 6 | 7 | 2013 | 3 | 1 | 6 | 3.00000 | 3.00000 | 18.00000 | 32.00000 | 500.00000 | 50.00000 | -2.60000 | 1026.50000 | -19.10000 | 0.00000 | NNE | 2.50000 | Aotizhongxin |
| 7 | 8 | 2013 | 3 | 1 | 7 | 3.00000 | 6.00000 | 19.00000 | 41.00000 | 500.00000 | 43.00000 | -1.60000 | 1027.40000 | -19.10000 | 0.00000 | NNW | 3.80000 | Aotizhongxin |
| 8 | 9 | 2013 | 3 | 1 | 8 | 3.00000 | 6.00000 | 16.00000 | 43.00000 | 500.00000 | 45.00000 | 0.10000 | 1028.30000 | -19.20000 | 0.00000 | NNW | 4.10000 | Aotizhongxin |
| 9 | 10 | 2013 | 3 | 1 | 9 | 3.00000 | 8.00000 | 12.00000 | 28.00000 | 400.00000 | 59.00000 | 1.20000 | 1028.50000 | -19.30000 | 0.00000 | N | 2.60000 | Aotizhongxin |
Last rows
| REF_NO | year | month | day | hour | PM2.5 | PM10 | SO2 | NO2 | CO | O3 | TEMP | PRES | DEWP | RAIN | wd | WSPM | station | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 403766 | 33639 | 2016 | 12 | 31 | 14 | 399.00000 | 412.00000 | 31.00000 | 198.00000 | 4900.00000 | 6.00000 | 3.80000 | 1021.90000 | -8.90000 | 0.00000 | SSE | 1.00000 | Wanshouxigong |
| 403767 | 33640 | 2016 | 12 | 31 | 15 | 449.00000 | 524.00000 | 30.00000 | 217.00000 | 5600.00000 | 8.00000 | 3.90000 | 1021.50000 | -6.10000 | 0.00000 | S | 1.40000 | Wanshouxigong |
| 403768 | 33641 | 2016 | 12 | 31 | 16 | 440.00000 | 440.00000 | 26.00000 | 200.00000 | 4700.00000 | 6.00000 | 2.80000 | 1021.50000 | -6.60000 | 0.00000 | SSE | 0.70000 | Wanshouxigong |
| 403769 | 33642 | 2016 | 12 | 31 | 17 | 378.00000 | 378.00000 | 20.00000 | 171.00000 | 3800.00000 | 4.00000 | 1.20000 | 1021.40000 | -5.50000 | 0.00000 | SSE | 1.10000 | Wanshouxigong |
| 403770 | 33643 | 2016 | 12 | 31 | 18 | 392.00000 | 458.00000 | 14.00000 | 160.00000 | 3900.00000 | 3.00000 | -1.30000 | 1021.90000 | -6.50000 | 0.00000 | S | 0.60000 | Wanshouxigong |
| 403771 | 33644 | 2016 | 12 | 31 | 19 | 449.00000 | 487.00000 | 10.00000 | 153.00000 | 4500.00000 | 4.00000 | -1.90000 | 1022.00000 | -6.10000 | 0.00000 | ESE | 0.90000 | Wanshouxigong |
| 403772 | 33645 | 2016 | 12 | 31 | 20 | 460.00000 | 492.00000 | 12.00000 | 146.00000 | 4100.00000 | 4.00000 | -2.50000 | 1022.40000 | -5.50000 | 0.00000 | ENE | 0.70000 | Wanshouxigong |
| 403773 | 33646 | 2016 | 12 | 31 | 21 | 463.00000 | 498.00000 | 12.00000 | 141.00000 | 4400.00000 | 5.00000 | -3.00000 | 1022.10000 | -5.30000 | 0.00000 | E | 0.90000 | Wanshouxigong |
| 403774 | 33647 | 2016 | 12 | 31 | 22 | 493.00000 | 537.00000 | 12.00000 | 124.00000 | 5000.00000 | 8.00000 | -3.00000 | 1022.70000 | -5.00000 | 0.00000 | SW | 0.10000 | Wanshouxigong |
| 403775 | 33648 | 2016 | 12 | 31 | 23 | 464.00000 | 490.00000 | 8.00000 | 111.00000 | 5400.00000 | 7.00000 | -4.00000 | 1022.60000 | -5.70000 | 0.00000 | ENE | 0.90000 | Wanshouxigong |